|
|
Accession Number |
TCMCG039C24436 |
gbkey |
CDS |
Protein Id |
XP_024030743.1 |
Location |
complement(join(199399..199626,199800..200522,200719..201018,201169..201226,201951..202094,202204..202460,203210..203416,203675..203848,203956..204023,204739..204797,226328..226403,226487..226592,226692..226801,227049..227213,227429..227514,228058..228122,229774..229821,230345..230464,230593..230668,231718..231815,231937..232069,232547..232665)) |
Gene |
LOC21393035 |
GeneID |
21393035 |
Organism |
Morus notabilis |
|
|
Length |
1139aa |
Molecule type |
protein |
Topology |
linear |
Data_file_division |
PLN |
dblink |
BioProject:PRJNA263939 |
db_source |
XM_024174975.1
|
Definition |
DNA mismatch repair protein MSH1, mitochondrial isoform X1 [Morus notabilis] |
CDS: ATGTACTGGTTGGCTACTCGAAACGCCGTCGTTTTCTCCTTGCATTGGCGTTCTCTCGCTCTTCTCCTTCGCTCTCCTCCTTGTAGATACAGCTCTTTCACTCCCTCTCCACTGCTCCTACCATTTGGGAGGATTTTTTGTTTTAAAGATCAGAGGATATTGAAAAGAAGCTTAAGAACTACAAGGAAAGTTAAGCCGTCGAACGATGTCTTGACCGAAAAGGAGCTTTCTAACATATTGTGGTGGAAAGAGAGGTTGCAAAACTGCAAAAAGCCTTCAACTGTCCAGCTAGTTAAAAGGCTTGAGTATTCAAATTTGTTAGGATTGGACGTCAACTTGAAAAATGGGAGTCTGAAAGAAGGAACACTCAACTGGGAGATGTTGCAGTTCAAGTCAAAGTTTGCTCGTGAGATTTTGCTCTGCAGAGTTGGGGAATTTTATGAAGCCATTGGGATAGATGCTTGCCTTTTAGTTGAATTTGCGGGTTTGAATCCTTTTGGGGGTCTGCGTTCAGATAGTATTCCAAGAGCTGGATGTCCAGTTATGAATCTTCGGCAAACTTTGGATGACCTGACACATAATGGATATTCGGTGTGCATAGTCGAAGAAGCTCAGGGTCCGACAAATGCTCGCTCTCGCAAAAGTCGTTTTATATCTGGGCATGCACACCCTGGTAGTCCTTATGTCTTTGGACTTGTTGGTCTTGATCATGATCTTGACTTTCCTGAACCAATGCCTGTTGTCGGGATATCTCGTTCTGCAAGAGGTTATTGCATAAATTTAGTCTTAGAGACTATGAAAACGTATTCATCTGAAGATGGTTTGACTGAAGAGGCCTTAGTCACAAAGCTTCGTACTTGTCGGCACCATCATCTGTTTTTGCATTCTTCGTTGAGGCACAATTCATCAGGCACTTGTCGTTGGGGAGAATTTGGGGAGGGAGGCTTACTGTGGGGGGAATGTACTGCCAGACATTTTGAATGGTTTGAAGGCAATCCTGTGACCGATCTTTTGGCTAAGGTTAAGGAGCTTTATGGTCTTGATGGAGAAGTTACATTTAGGAATGTTACCGTTACTTCAGAAAATAGGCCCCGGCCTTTAACCCTCGGAACAGCGACCCAAATTGGTGCCATACCAACCGAAGGAATACCTTGTTTGTTGAAGGTGTTGCTTCCTCCAAATTGTGCAGGTCTACCTGTGACTTACATTAGAGATCTTCTTCTTAATCCTCCCGCATATGAAATTGCATCCACAATTCAAGCAACGTGCAAACTTATGAGCAATGTGACTTGCTCGATTCCGGACTTTACCTGTGTTTCATCTGCAAAGCTTGTGAAGCTACTTGAACTAAGGGAGGCCAATCATATTGAGTTTTGTAGAATAAAAAACGTAGTGGATGAATTGTTGCTCATGACTAAAAATTCTGAGCTCAGTGAAATCCTGAAATTGTTGTTGGACCCTACGTGGGTGGCCACTGGCTTGAAAATTGATTTTGAGACATTAATTGATGAATGTGAATGGACTTCAAATAAAATTGGTGAAATGATCTCTCTAGATGGTGAGAGTGATCAAAAAATCAGTTCCTCTTCTATTGTTCCTGATGATTTCTTTGAGGATATGGAGTCGTCATGGAAAGGTCGTGTCAAGAAGGTCCATATAGGGGAAGAATTTGCAGCAGTGGAAAGGGCAGCTGAGGCTTTAACTCTAGCAGTTTCTGAAGCCTTCCTGCCCATCATTACAAGAATAAGGGCTACCACAGCCCCACTTGGAGGCCCAAAGGGGGAAATATTGTATGCTAGGGAGCATGAGGCTGTTTGGTTTAAGGGAAAAAGGTTTTTACCTGCTGTATGGGCCGGTACCCCGGGGGAGCAACAGATTAAACTTCTCAAACCCGCTTTAGATTCAAAAGGTAGAAAAGTTGGAGAAGAATGGTTTACTACAATGAAGGTGGAGGATGCTTTAACAAGGTACCATGAGGCTGGTGCCAAGGCAAAAGCAAGGGTCTTGGAATTGTTGAAGGGACTTTCTTCTGAACTACAAGCAAAGACTAACATTTTAGTATTTGCTTCAATGCTACTAGTTATAGCAAAGGCATTGTTTTCTCATGTGAGTGAAGGCAGAAGAAGGAAATGGGTTTTCCCCACTCTTCTGGAGTTACCCCTGTCTAAGGATGTAAAACCATCGAATGGAGCTGAGGGAATGAAGCTGGTTGGTCTATCACCGTACTGGTTTGATGTAGCAGAAGGTAGTGCTGTAAATAATACTGTTGATATGCAATCACTGCTTCTTTTGACTGGACCAAATGGTGGCGGTAAATCAAGTTTGCTTCGATCACTTTGTGCTGCTGCATTACTTGGAATCTGTGGGTTCATGGTGCCTGCTGAGTCGGCTTTCATTCCACATTTTGATAATATCATGCTTCACATGAAATCTTACGATAGCCCAGCTGACGGAAAAAGTTCATTTCAGGTTGAAATGTCAGAGATCCGATCAATTATCTCGGCCACAAGTAAAAGAAGCCTTGTGCTTATAGACGAAATATGTCGAGGAACAGAAACGGCAAAAGGAACTTGTATTGCCGGCAGCATTGTTGAAACTCTGGATAAAATAGGTTGCCTAGGTATTGTGTCCACTCACTTGAATGGGATATTTAGTTTGCCACTCAAGGCAAAGAACACCATGTTTAAGGCCATGGGAACCGTTTATGTCGATGGCCAAACGAAACCAACTTGGAAACTAATGGACGGGATTTGTAGAGAGAGCCTTGCATTTGAGACTGCTAAGAGAGAAGGAATGCCTGAAACAATAATACAAAGAGCTGAAGAGCTGTACGATTCAGTTTATGCAAAAGAGGTGGTTCCGGCAGAAAATGACTCTAAACTACAAAACATGTGCTCTTATACAAGTTTCAACGGTTCCAATGTATCTCTTCAATCAAATTCTGGTGAAAAAGATTCTGAAAGGGGCAGACCAACAGACCGAATGGAACTCTTGCAAAAGGAAGTTGAGACTGCTGTTACTATGATATGCCAAAGGAAGTTGATAGAGCTGTACAAGAAGGAGAAAACATCAGAACTTACTGAGATTCACTGCGTTCTGATTGGTGCCAGGGAACAACCACCTCCTTCAACTGTAGGTGCTGCATGCGTCTATGTGATGCTAAGGCCTGATAAGAAACTATACGTTGGACAGTCGGATGATTTGGAGGGCCGAGTCCGAACCCACCGTTCAAAGGATGGGATGCAGAAAGCCAATTTCCTTTACTTCACAGTCCCAGGAAAGAGTCTGGCATGTCAACTGGAGACTCTTCTCATCAATCAACTTCCCAACCAAGGATTCCATGTCACTAATGTGGCTGATGGTAAACATAGGAATTTTGGCACATCCTGTCTCTCTCCGGAAAGTGCGACTGTTTGTTAA |
Protein: MYWLATRNAVVFSLHWRSLALLLRSPPCRYSSFTPSPLLLPFGRIFCFKDQRILKRSLRTTRKVKPSNDVLTEKELSNILWWKERLQNCKKPSTVQLVKRLEYSNLLGLDVNLKNGSLKEGTLNWEMLQFKSKFAREILLCRVGEFYEAIGIDACLLVEFAGLNPFGGLRSDSIPRAGCPVMNLRQTLDDLTHNGYSVCIVEEAQGPTNARSRKSRFISGHAHPGSPYVFGLVGLDHDLDFPEPMPVVGISRSARGYCINLVLETMKTYSSEDGLTEEALVTKLRTCRHHHLFLHSSLRHNSSGTCRWGEFGEGGLLWGECTARHFEWFEGNPVTDLLAKVKELYGLDGEVTFRNVTVTSENRPRPLTLGTATQIGAIPTEGIPCLLKVLLPPNCAGLPVTYIRDLLLNPPAYEIASTIQATCKLMSNVTCSIPDFTCVSSAKLVKLLELREANHIEFCRIKNVVDELLLMTKNSELSEILKLLLDPTWVATGLKIDFETLIDECEWTSNKIGEMISLDGESDQKISSSSIVPDDFFEDMESSWKGRVKKVHIGEEFAAVERAAEALTLAVSEAFLPIITRIRATTAPLGGPKGEILYAREHEAVWFKGKRFLPAVWAGTPGEQQIKLLKPALDSKGRKVGEEWFTTMKVEDALTRYHEAGAKAKARVLELLKGLSSELQAKTNILVFASMLLVIAKALFSHVSEGRRRKWVFPTLLELPLSKDVKPSNGAEGMKLVGLSPYWFDVAEGSAVNNTVDMQSLLLLTGPNGGGKSSLLRSLCAAALLGICGFMVPAESAFIPHFDNIMLHMKSYDSPADGKSSFQVEMSEIRSIISATSKRSLVLIDEICRGTETAKGTCIAGSIVETLDKIGCLGIVSTHLNGIFSLPLKAKNTMFKAMGTVYVDGQTKPTWKLMDGICRESLAFETAKREGMPETIIQRAEELYDSVYAKEVVPAENDSKLQNMCSYTSFNGSNVSLQSNSGEKDSERGRPTDRMELLQKEVETAVTMICQRKLIELYKKEKTSELTEIHCVLIGAREQPPPSTVGAACVYVMLRPDKKLYVGQSDDLEGRVRTHRSKDGMQKANFLYFTVPGKSLACQLETLLINQLPNQGFHVTNVADGKHRNFGTSCLSPESATVC |